Toward portable information extraction

نویسنده

  • Mihai Valentin Tablan
چکیده

v Acknowledgements vii

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Translingual Information Access Using Portable Information Extraction

We report on a small study undertaken to demonstrate the feasibility of combining portable information extraction with MT in order to support translingual information access. After describing the proposed system's usage scenario and system design, we describe our investigation of transferring information extraction techniques developed for English to Korean. We conclude with a brief discussion ...

متن کامل

Information Extraction Tools for Portable Document Format

Interest in the new publishing phenomenon known as e-book has grown enormously in last few years. There are now at least 150 companies involved in various ways in the development of e-books. Despite this involvement the spread of e-books has not yet useful in implementation of digital libraries. The use of e-books of PDF format in the implementation of digital library requires a robust informat...

متن کامل

Simple Information Extraction (SIE): A Portable And Effective IE System

This paper describes SIE (Simple Information Extraction), a modular information extraction system designed with the goal of being easily and quickly portable across tasks and domains. SIE is composed by a general purpose machine learning algorithm (SVM) combined with several customizable modules. A crucial role in the architecture is played by Instance Filtering, which allows to increase effici...

متن کامل

PIA-Core: Semantic Annotation through Example-based Learning

This paper summarizes the aims and scope of the PIA (Portable Information Access) project’s PIA-Core system for automatic annotation of documents on the Semantic Web, i.e. the next generation World Wide Web. The focus of the project is to develop a portable information extraction system that can be easily adapted to new domains. PIA has its foundations on three resources: the PIA-Core informati...

متن کامل

TAO: System for Table Detection and Extraction from PDF Documents

Digital documents present knowledge in most areas of study, exchanging and communicating information in a portable way. To better use the knowledge embedded in an ever-growing information source, effective tools for automatic information extraction are needed. Tables are crucial information elements in documents of scientific nature. Most publications use tables to represent and report concrete...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009